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Abstract Estimating the engagement is critical for hu¬ 
man - robot interaction. Engagement measures typi¬ 
cally rely on the dynamics of the social signals ex¬ 
changed by the partners, especially speech and gaze. 
However, the dynamics of these signals is likely to be 
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influenced by individual and social factors, such as per¬ 
sonality traits, as it is well documented that they criti¬ 
cally influence how two humans interact with each other. 
Here, we assess the influence of two factors, namely 
extroversion and negative attitude toward robots, on 
speech and gaze during a cooperative task, where a hu¬ 
man must physically manipulate a robot to assemble 
an object. We evaluate if the score of extroversion and 
negative attitude towards robots co-variate with the 
duration and frequency of gaze and speech cues. The 
experiments were carried out with the humanoid robot 
iCub and N=56 adult participants. We found that the 
more people are extrovert, the more and longer they 
tend to talk with the robot; and the more people have 
a negative attitude towards robots, the less they will 
look at the robot face and the more they will look at 
the robot hands where the assembly and the contacts 
occur. Our results confirm and provide evidence that 
the engagement models classically used in human-robot 
interaction should take into account attitudes and per¬ 
sonality traits. 

Keywords Human-robot interaction • social signals • 
engagement • personality 

1 Introduction 

Service and personal robots must be capable of cooper¬ 
ating and interacting with humans for a variety of tasks. 
The robot’s social skills are crucial to prevent the inter¬ 
action to become cumbersome and the cooperation less 
effective. Social signals, i.e., verbal and non-verbal cues 
produced by the human and directed towards the robot, 
may reveal the engagement and ease of the person dur¬ 
ing the task, whether or not a physical interaction is 
entailed [5|I26| I10|. 

The ability to estimate engagement and regulate so¬ 
cial signals is particularly important when the robot 
interacts with people that have not been exposed to 
robotics, or do not have experience in using/operating 
them: a negative attitude towards robots, a difficulty 
in communicating or establishing mutual understand¬ 
ing may cause unease, disengagement and eventually 
hinder the interaction. 

It seems therefore necessary to study how individ¬ 
ual and social factors influence the issue of social signals 
during human-robot interaction, together with their re¬ 
lations to acceptance and engagement. 

To evaluate the engagement during human-robot in¬ 
teraction, the most common metrics are based on the 
temporal dynamics of social signals, in particular gaze 
and speech mm- The exchange of gaze (mutual and 
shared), the contingency of reactions to speech and gaze 
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Fig. 1 Conceptual representation of the experiment: we study the relation of extroversion and negative attitude toward robots 
on speech and gaze during a cooperative assembly task. 


cues, the temporal dynamics of speech (utterance num¬ 
ber, frequency, duration) are among the most common 
indicators of engagement during dyadic tasks [26 . 

However, there is evidence from the psychology lit¬ 
erature that the dynamics of these social signals can be 
altered by individual factors nzMsmsi]: we refer here to 
the set of behavioral, emotional, and cognitive tenden¬ 
cies that people display over time and across situations 
and that distinguish individuals from one another, such 
as personality traits and social attitudes. The influence 
of personality traits on human behaviors during inter¬ 
actions with robots has been also documented in several 
studies [49llT3llTj . 

Two individual factors seem particularly interesting 
for HRI: extroversion, a personality traits that is as¬ 
sociated to positive emotions and social behavior [20] , 
and negative attitude towards robots [34] . a personal 
attitude that captures the projected anxiety of the per¬ 
son toward the interaction with a robotic device. Re¬ 
cent studies showed that there is a correlation between 
these traits/attitudes and the issue and dynamics of so¬ 
cial signals, in particular gaze and speech [3d. In this 
case, if they impact the issue of such social signals, they 
also affect the power of the metrics used as indicators 
of engagement. 

Following this line of thought, the goal of this work 
is to study the relation between individual factors (ex¬ 
troversion and attitude toward robots) and the dynam¬ 
ics of gaze and speech produced by the human during 
an interaction with a robot (see Figure [TJ. 

For this purpose, we designed a collaborative assem¬ 
bly task between a human and a robot. We made video 
and audio recordings (see Figure |2| of the interactions 
between the humanoid robot iCub and adult partic¬ 
ipants who previously submitted their questionnaires 
for evaluating the extroversion and negative attitude to¬ 
wards robot^] The questionnaire scores were later used 

1 In social psychology, there is a net distinction between 
personality traits and attitudes. Here, we use methods from 


to study the issue (frequency and duration) of utter¬ 
ances and gaze towards the robot issued by the human 
partner. Since our experiment also involved a physi¬ 
cal contact between the robot and the person during 
the assembly, we distinguished between gaze towards 
the robot face and gaze directed towards the robot’s 
hands, that perform the assembly thanks to the human 
guidance. 

Our study shows that, at least for the cooperative 
assembly task, there is a correlation between extrover¬ 
sion score and the speech frequency and duration, while 
the negative attitude is related to the duration of gaze 
towards the robot. To summarize: 

— the more one is extrovert, the more he/she will talk 
to the robot 

— the more one has a negative attitude towards a robot, 
the less he/she will look at the robot face and the 
more he/she will look at the robot hands, where the 
physical interaction for the assembly takes place 

As gaze and speech are the main social signals used 
to evaluate engagement m, we provide significant re¬ 
sults supporting the idea that engagement models used 
in HRI should take into account individual factors that 
can influence the production of such social signals. 

By gaining a deeper understanding of the inter¬ 
individual factors that influence the exchange of gaze 
and speech during cooperative tasks, we aim at improv¬ 
ing the design of robot controllers during social and 
physical interaction. More generally, we would like to 


differential psychology rather than social psychology: the dis¬ 
tinction between the two is not important, as long as the two 
factors are two characteristics of the individual that are eval¬ 
uated at a certain time prior to the interaction. We measured 
the attitude towards robots with the NARS questionnaire, a 
test that was created to capture the projected anxiety of the 
person before its interaction with the robot. We used it to 
evaluate an individual attitude prior to the direct interaction 
with the robot (participants filled the NARS questionnaire 
several days before the experiment - see details about the 
experimental procedure in Section 4.4). 
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turn our findings into implications for the design of 
robot controllers that can adapt to the individual dif¬ 
ferences of the human partners. 

2 Background 

2.1 Social signals: the building blocks for assessing 
engagement 

During interaction, a multitude of verbal and non-verbal 
signals are exchanged between the two partners. These 
so called social signals and their dynamics are the main 
bricks for the evaluation of the engagement in HRI. 

The engagement is defined as “the process by which 
individuals involved in an interaction start, maintain 
and end their perceived connection to one another” 
[46] . As discussed in [3], the engagement is related to 
the user experience, to the perceived control, feedback, 
interactivity, attention, and the fluctuations of the en¬ 
gagement during interaction are reflected into physio¬ 
logical changes and behavioral changes through verbal 
and non-verbal communication. 

A social signal may be defined as “a communicative 
or informative signal, or a clue which, directly or indi¬ 
rectly, provides an information about social facts, i.e. 
interactions, emotions, attitudes, valuations or social 
behaviors, social relations or identities” [38]. The scope 
of social signals potentially extends to a large variety of 
behaviors and expressions: gestures, facial expressions, 
postures, gazes, etc. Anzalone et al. (3 partition the 
set of metrics for engagement evaluation into static and 
dynamic features. The first set comprises focus of at¬ 
tention and gaze analysis, head and body postural sta¬ 
bility, with evaluation of pose and variance. The second 
set comprises joint attention, reaction times to atten¬ 
tion cues, imitation, synchrony and rhythm of interac¬ 
tion. 

To assess the engagement during HRI experiments 
and tasks, researchers usually considers a subset of these 
social signals (see Table [TJ, frequently focusing on gaze 
and speech. 

Gaze is one of the most important cues and carriers 
of information during the interaction. It is indeed well 
established that mutual gaze and eye contact are crucial 
during human-human interaction m ■■ the absence of 
eye contact at the right time, for instance at the end of 
a sentence, can be perceived as a withdrawal from the 
conversation and a sign of disengagement. Gaze in HRI 
can be analyzed differently depending on its direction 
and target. For example, during verbal interaction mi 
[25] or learning games [26] it can be mutual (when the 
robot and the human partner look at each other) or 
directed/joint (when the robot and the human look at 


the same object or in the same direction). A third type 
of gaze can be the one directed by the human towards 
the robot, that the latter can return or not, depending 
on its joint attention skills [45]. 

Speech, and more specifically the dynamics of ver¬ 
bal exchange (e.g., turn-taking), is the other most im¬ 
portant social signal for interaction, and it is a crucial 
indicator in the assessment of engagement mm- The 
metrics used for evaluating the engagement using this 
signal are for example the number, frequency and dura¬ 
tion of utterances mm, the reaction time to utterance 
cues [26]. Le Maitre and Chetouani [28] also proposed 
a qualitative distinction between language actions in¬ 
volving the locutions directed towards the robot, and 
those towards oneself. 

Body language, which includes non-verbal behaviors 
such as facial expressions, posture and gestures, can also 
convey the intention and the engagement of the human 
partner. For example, Sanghvi et al. [42] analyzed the 
individual postures (the inclination and curve of the 
back) and their changes to assess the engagement of 
children playing chess against a humanoid robot. The 
engagement was also studied in relation to positive fa¬ 
cial expressions (e.g., smiles rather than grins) 9], head 
movements such (e.g., nodding) [47] and gestures re¬ 
sponding responding to a robot cue SB- 

To summarize, there are numerous studies that char¬ 
acterize the engagement and the interaction between 
humans and robots through the analysis of verbal and 
non-verbal signals. However gaze and speech are the 
most common social signals used to evaluate the en¬ 
gagement , as clearly showed in Table [l] 

Since the engagement is a sort of emotional “state” 
of the human partner during the social interaction, and 
it may fluctuate over the interaction, it is interesting to 
study the temporal dynamics of the social signals and 
the salient events associated to their evolution during 
the interaction. To estimate the engagement in HRI us¬ 
ing the exchanged social signals, there are two main 
approaches in the literature. 

The first approach consists in assessing the engage¬ 
ment of the human partner in real time. For instance, 
with a probabilistic approach Ishii et al. m demon¬ 
strate that a certain sequence of three gazing primitives 
(towards the object designated by the agent, towards 
the agent and towards any other direction) can reliably 
predict the human subject’s withdrawal from an inter¬ 
action. In their experiment, the robot was introducing 
a new model of mobile phone, and a sequence of gaze 
towards the robot then twice towards unrelated objects 
was linked to a disengagement. 

In the second approach, that we may consider as 
“global”, the engagement is neither measured in real 
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Study 

Ref 

Social signals used to assess the engagement 

Castellano et al., 2009 

le] 

Gazes towards the robot 

Smiles 

Ishii et al., 2011 

|25j 

Gazes Towards the object the agent is talking about 

Gazes Towards the agent’s head 

Gazes Towards anything else 

Ivaldi et al., 2014 

m 

Reaction time to the robot attention utterance stimulus 

Time between two consecutive interactions 

Le Maitre and Chetouani, 2013 

m 

Utterance directed to the robot 

Utterance directed to self 

Rich et al., 2010 

|4lj 

Gazes Focused (man and robot are looking at the same object 

Gazes Mutual (man and robot look at each other) 

Utterance Adjacent (two successive locutions, produced one by the robot, the 
other by the human, separated by a maximum interval) 

Utterance Responses (the subject responds to the robot through a gesture or a 
very short verbal intervention) 

Sanghvi et al., 2011 

m 

Postures (curve and inclination of the back) 

Sidner et al., 2004 

m 

Gazes Shared (mutual or directed) 

Gazes Directed towards the robot without the latter looking at the human 

Sidner et al., 2005 

m 

Gazes Shared (mutual or directed) 

Gazes Directed towards the robot without the latter looking at the human 


Table 1 Social signals used in literature as metrics for the assessment of engagement. 


time, nor on time intervals, but on the interaction as a 
whole. For instance, Sidner et al. mm suggest a met¬ 
ric combining the shared gazing time and the time spent 
by the subject looking at the robot for the whole dura¬ 
tion of the interaction on one hand, and the assessment 
of the number of gazes that the participant returns to 
the robot during the same period of time on the other 
hand. With a similar approach, Rich et al. m devel¬ 
oped a composite index defined by the average time 
intervals between two social connection events between 
the robot and the user in the course of an interaction, 
where the robot had to teach the participant how to 
prepare cookies. According to the authors, the events 
were divided into four sorts: 1) directed gaze, 2) mutual 
gaze, 3) adjacent utterances when two are produced in 
succession, one by the robot and one by the participant, 
with a maximum time gap between them, and 4) the 
replies to the robot with a gesture or a very short ut¬ 
terance. In the aforementioned studies, the researchers 
also observed the effect of various cues of the robot (e.g., 
robot nodding vs. not nodding) on the engagement of 
the user during the interaction. A specific approach on 
the whole interaction was proposed by Le Maitre and 
Chetouani 128]: they proposed the ratio between the 
talking time directed towards the robot and the one to¬ 
wards oneself as indicator of engagement, with the ra¬ 
tionale that an increased verbalization directed towards 
the robot can be interpreted as a stronger engagement 
(whereas the more the people talked to themselves, the 
lesser the engagement). 

To summarize, both considering the whole interac¬ 
tion and thin slices of interaction, measuring the en¬ 


gagement in HRI relies on the dynamics of the ex¬ 
changed social signals, particularly gaze and speech. 

However, there are no models that take into account 
context, task, social or individual factors that may af¬ 
fect the production of such signals, and subsequently 
the evaluation of the engagement. 

To the best of our knowledge, the HRI literature 
considering the inter-individual differences (concerning 
the personality) or the attitude (positive or negative) 
towards robots in the production of those signals is 
scarce. When discussing models of engagement, the hu¬ 
man individual is considered as “abstract”, expected to 
produce the same social signals at the same rhythm, 
despite any inter-individual difference that may affect 
the communication, the establishment and the contin¬ 
uation of the social interactions. 

It is however rational to consider that there can 
be personality traits, dispositions or attitudes that can 
make people talk, look and behave in a different way 
when facing the same interaction, especially with a de¬ 
vice such as a robot. For example, an introvert indi¬ 
vidual may talk less to or look less at the robot than 
an extrovert individual, without however being neces¬ 
sarily less engaged than the other. An individual with 
negative attitude towards robot may look less at the 
robot face, and look more at the robot’s body, especially 
during close or physical interactions with the robot. In 
short, the effect of personality characteristics and of the 
attitudes towards robots could impact the dynamics of 
social signals, and subsequently undermine the metrics 
and models used in the literature to assess the engage¬ 
ment in HRI. 














Title Suppressed Due to Excessive Length 


5 


2.2 Personality traits and attitudes 

As explained by Ajzen [24] . “attitudes and personality 
traits are latent, hypothetical dispositions that must be 
inferred from observable responses”. Their effect should 
be therefore observable on the overt actions of the in¬ 
dividual. The boundary between traits and attitudes is 
under debate; however it is acknowledged that both at¬ 
titudes and personality traits influence our actions and 
behaviors, together with other social, contextual and 
individual factors |44| . To make it simple, a personal¬ 
ity trait is a characteristic of the human personality 
that leads to consistent patterns of behaviors, and is 
assumed to be almost invariant for an adult. An atti¬ 
tude is a behavior tendency, directed towards people, 
objects, situations, and is generally determined by the 
social context, the background and experiences of the 
individual [54] . 

2.2.1 Personality trait: extroversion 

The personality of an individual consists of several char¬ 
acteristics and dispositions, each being described as a 
“gathering of attitudes obviously linked to each other, 
or as patterns of cognitive treatment of the information 
or underlying psycho-physiological mechanisms gener¬ 
ating specific dispositions towards some behaviors” f [44] . 

p.116). 

Among the existing personality models, the most 
well-known and studied is the Big Five [12], which owes 
its name to the five traits descriptive of a personal¬ 
ity: Extroversion , Neuroticism , Agreeableness , Consci¬ 
entiousness , Openness to Experience. This model is widely 
used in psychology to predict human behavior and cog¬ 
nition [55130], and is more and more also used in human- 
robot interaction EMM]- 

The extroversion dimension is the personality trait 
that notably (i) shows up more clearly during interac¬ 
tion, and (ii) has the greater impact on social behavior 
with respect to the other traits [57]. It is linked to pos¬ 
itive emotions, and identified through the tendency to 
be sociable, talkative, and self confident [T2] , It seems 
to be fundamental to shape the way people interact 
m and to establish and maintain social relations m • 
Beatty et al. 0 suggest that extroversion is one of the 
three major factors, together with neuroticism and psy- 
choticism, that have some bearing on communication. 
Moreover, it would also have an impact on the way in¬ 
dividuals behave, and even on the quality of new social 
relations [71- 

Alt hough there is evidence in social psychology about 
potential links between the emission of various social 
signals (verbal and non-verbal) and the personality pro¬ 


file [4], quantitative evidence is still needed. In particu¬ 
lar, the current knowledge about extroversion and the 
issue of verbal and non-verbal signals is mostly limited 
to verbal dyadic and group interactions where there is 
typically no physical contact. 

Generalizing and characterizing the influence of in¬ 
dividual differences and extroversion on verbal and non¬ 
verbal behaviors (e.g., gaze, head movements) is diffi¬ 
cult D2I; however, the literature in human-human in¬ 
teraction reports some evidence that the production of 
gaze and speech correlates to the level of extroversion of 
the individuals. For example, the level of extroversion 
has an effect on the frequency and duration of gazes 
towards a person during face-to-face conversations [56]: 
extroverts gaze longer than introverts. In a similar way, 
Wu et al. [55] showed that extrovert individuals tend to 
focus their attention on the area of the eyes on pictures 
of human beings longer than introverts. The influence of 
personality traits, especially extroversion, on the gaze 
is also reported for non-social tasks such as fixating ab¬ 
stract images m ■ 

With regards to verbal communication, Costa et al. 
[121 noted that one of the most clear signs of extrover¬ 
sion for an individual is to be more talkative, which also 
leads to a lesser number of pauses during conversation 
[44] . Extrovert people would also tend to use shorter 
sentences at an increased rate than introvert people in 
informal situations involving another language [2] - The 
link between extroversion and speech dynamics was ex¬ 
ploited for automatic classification of personality from 
videos of interaction between small groups of people: in 
[37[[29] the authors showed that the talking and inter¬ 
action timing ratio are positively correlated to the level 
of extroversion. 

To summarize, there is evidence from the literature 
on the influence of the extroversion trait on the dynam¬ 
ics of gaze and speech in human-human interaction. 
This certainly biases the current metrics and models 
for assessing engagement, that do not take into account 
such individual factors [3|l41]. 

Extending such studies to human-robot interaction, 
with the variability of tasks, situations and robots, it is 
certainly challenging. In this paper, we provide evidence 
that the dynamics of gaze and speech is related to the 
extroversion during a human-robot assembly task. 

2.2.2 Negative attitude towards robots 

As the literature seems to allege, extroversion may bring 
up inter-individual communication differences during 
social interactions between humans. While aversion to¬ 
wards other people may be identified through the per¬ 
sonality models, there is currently no model that allows 
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us to assess the dislike of technology, and more specif¬ 
ically robots. An individual may appear to be very so¬ 
ciable, while very wary of technology. For robots, this 
evaluation seems particularly critical. Currently, robots 
are diffused in factories and service and are mostly used 
or operate by skilled people that received some robotics 
training (i.e., experts). However, robots are gradually 
becoming available and accessible outside the classical 
settings, to ordinary people that have not received any 
robotics training (i.e., non-experts). Ordinary people 
without a proper knowledge of the platform are not 
typically aware of the limits and the real capabilities 
of the robots, because of their lack of prior experience 
with them and frequently limited background knowl¬ 
edge. Some people might be technophobic, some might 
have developed an anxiety towards robots, influenced 
by some recent trends in the public medis^] some may 
be influenced positively or negatively by movie^] and 
literature m • This a priori may reflect in differences 
in their behavior and communication, and not be de¬ 
pendent necessarily by their personality traits. 

It seems therefore necessary to take into account a 
personality characteristic that is related more to tech¬ 
nology rather than human beings, and more particu¬ 
larly to social robots and humanoids. 

This category of robots has been recently studied 
to better understand the reasons that may cause neg¬ 
ative attitudes towards this “too human-like” technol¬ 
ogy @3]. The most known negative effect linked to the 
robot appearance is the so called “Uncanny Valley” ef¬ 
fect: described by Mori in 1970, it describes the fact 
that a robot excessively “human-like” arouses a sense 
of unease and repulsion, whereas robots with a moder¬ 
ate level of human likeness or humanoids that can be 
clearly identified as machines arouse more affinity [32]. 
While numerous studies show that the humanoid ap¬ 
pearance is accountable for opinions and attitudes to¬ 
wards the robots [21], other factors also seem to affect 
these attitudes: movements speed and profiles, distance 
during the interaction, voice and temporal dynamics 
of verbal exchanges between the human and the robot. 
From a methodological point of view, attitudes towards 
the robots are usually assessed through free verbaliza¬ 
tion (e.g., interviews) and attitude scales. Nomura and 
colleagues [35|l33] developed a questionnaire for the val- 

2 See for example the press article: “Will workplace robots 
cost more jobs than they create?” http://www.bbc.com/ 
news/technology-27995372 

3 We interviewed our participants after the experiments. 
Some reported that they “do not like robots because they 
are going to take our jobs”. Some reported to have enjoyed 
the experiment with the robot and made explicit reference 
to their expectations being influenced by “the robots of Star 
Wars”. 


uation of negative attitudes towards humanoid robots: 
the Negative Attitude towards Robots Scale (NARS). 
In a series of studies, they could demonstrate the effect 
of a negative attitude towards robots on the communi¬ 
cation, in particular on the time of the verbal response, 
which increases with the more the negative attitude of 
an individual. 

It appears that a negative attitude towards robots 
has therefore an impact on the way people interact ver¬ 
bally with a robot. Someone with a more negative atti¬ 
tude towards robots may talk less to the robots: this 
could be misinterpreted as a sign of disengagement. 
Since speech dynamics is one the main indicators for 
engagement assessment, it should be recommended to 
take into account the impact of attitudes in the models 
for assessing the engagement based on the interpreta¬ 
tion of social signals emitted by the human during HRI. 

Incidentally, the influence of the negative attitude 
towards robots on social signals has been studied dur¬ 
ing interaction tasks with a significant verbal compo¬ 
nent, but not yet in tasks with physical interaction. 
However, since this attitude captures the worry of the 
person projected towards an interaction with a robot, 
we expect that its influence on the social signals will be 
more visible in tasks with contacts between the robot 
and human. In this case, the close proximity with the 
robot and the touch should highlight the unease and 
anxiety of the human. This effect was observed by Chen 
et al. in the robot nursing experiments m , where the 
authors showed that people with negative attitude to¬ 
wards robots responds less favorably to robot-initiated 
touch. Our intuition is that touching the robot in par¬ 
ticular should produce more distress, therefore making 
the humans gaze more at the body parts where the in¬ 
teraction occurs. 


3 The study 

3.1 Rationale 

There are several studies on the influence of individ¬ 
ual factors on the production of social signals during 
human-human interactions (for example, [291152] ). Re¬ 
cent studies on the link between personality traits and 
social signals have also appeared in the HRI community 
(for example, mm)- 

However, to the best of our knowledge there is no 
study yet examining the relation of individual factors to 
gaze and speech during an assembly task. In this type of 
cooperative tasks, the interaction between the human 
and a robot entails a physical and a social dimension. 
The contact with the robot (at the level of the hands, 
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Fig. 2 The experimental setup. The participant is standing in front of the robot iCub; their interaction is recorded by a 
Kinect, two standard HD cameras (front and side view of the scene). The experimenter monitors the interaction from the 
side, not too far but close enough to be able to push the safety button and intervene in case of emergencies. The operator 
is hidden behind a wall, and he controls the robot monitoring the interaction through a webcam placed over the robot. The 
power supply and cluster of the robot are hidden behind a cabinet. 


in this case) and the close proximity between the part¬ 
ners may induce variations of the production of gaze 
and speech with respect to simple face-to-face inter¬ 
actions with a predominance of verbal exchange. The 
alterations of the dynamics of the signals could be due 
to the task and/or to some characteristics of the indi¬ 
vidual, for example its personality or attitude towards 
robots. 

The engagement models do not currently differenti¬ 
ate between tasks with or without contact, and do not 
take into account individual factors that may induce 
changes in the dynamics of social signals. 

It is therefore necessary to provide evidence of the 
relation between these elements to improve the classi¬ 
cal models of engagement. We do it in this paper for a 
dyadic task that is fundamental for robotics in service 
and industry: the cooperative assembly. Furthermore, 
it seems necessary to take a comprehensive approach 
with respect to the individual factors, considering per¬ 
sonality traits and attitudes towards robots, as the per¬ 
sonality traits alone could not be sufficient to explain 
the variation of the social signals during an interaction 
with a robot. 


3.2 Research hypotheses 

Based on the literature review discussed in Section[2j we 
expect that participants that have high scores of extro¬ 
version will talk more to the robot; we also expect that 
participants with a very high negative attitude towards 


robots score will avoid gazing at the robot. Due to the 
specificity of the task, involving a contact between the 
human and the robot, we expect that participants with 
a high negative attitude towards robots will gaze more 
at the robot hands (area of contact between the human 
and the robot). 

Therefore, we pose five research hypotheses: 

Hypothesis 1: If the extroversion dimension is re¬ 
lated to the frequency and duration of utterances ad¬ 
dressed by the human to the robot, then we should find 
a positive correlation between the questionnaire score of 
extroversion and these variables. 

Hypothesis 2: If the extroversion dimension is re¬ 
lated to the frequency and duration of gazes directed 
towards the robot’s face, then we should find a positive 
correlation between the questionnaire score of extrover¬ 
sion and these variables. 

Hypothesis 3: If the negative attitude towards robots 
is related to the frequency and duration of the utterances 
addressed by the human to the robot, then we should find 
a negative correlation between the questionnaire score of 
the negative attitude towards robots and these variables. 

Hypothesis 4: If the negative attitude towards robots 
is related to the frequency and duration of gazes directed 
towards the robot’s face, then we should find a negative 
correlation between the questionnaire score of the neg¬ 
ative attitude towards robots and these variables. 

Hypothesis 5: If the negative attitude towards robots 
is related to the frequency and duration of gazes directed 
towards the areas of contacts between the human and the 
robot, then we should find a positive correlation between 
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the questionnaire score of the negative attitude towards 
robots and these variables. 

The hypotheses were tested through an interaction 
task where human participants had to cooperate with 
the humanoid robot iCub [33] to assemble an object. 
We made video and audio recordings of the interactions 
between the humanoid iCub and adult participants who 
previously submitted their questionnaires for evaluating 
the extroversion and negative attitude towards robots j^] 

This task was part of a set of experiments within 
the project “Engagement during human-humanoid in¬ 
teractions” (EDHHlfj to investigate the acceptance, 
engagement and spontaneous behavior of ordinary peo¬ 
ple interacting with a robot. The experimental proto¬ 
col used in this work (Ivaldi et al., “Engagement during 
human-humanoid interaction”, IRB n.20135200001072) 
received approbation by the local Ethics Committee 
(CERES) in Paris, France. 

4 Materials and methods 

4.1 Questionnaires 

To evaluate the extroversion and the attitude towards 
robots of the participants, we used two questionnaires: 
the Revised Personality Inventory (NEO-PIR) JT] and 
the Negative Attitude towards Robots Scale (NARS) 

EH- 

The first is used to assess the personality traits ac¬ 
cording to the Big Five model [20] . The official French 
adaptation of the questionnaire was used m We re¬ 
tained only the questions related to the assessment of 
the extroversion dimension, that is 48 questions divided 
into six facets: Warmth, Gregariousness, Assertiveness, 
Activity, Excitement seeking and Positive emotion^] 
The order of the questions followed the original ques¬ 
tionnaire; answers were on a Likert-type scale from 1 
(Totally disagree) to 5 (Totally agree). 

The second questionnaire consists of 14 questions 
divided in three sub-scales: “Negative attitude towards 

4 In social psychology, there is a net distinction personal¬ 
ity traits and attitudes. Here, we use methods from differen¬ 
tial psychology rather than social psychology. We measured 
the attitude towards robots with the NARS questionnaire, a 
test that was created to capture the projected anxiety of the 
person before its interaction with the robot. We used it to 
evaluate an individual attitude prior to the direct interaction 
with the robot (participants filled the NARS questionnaire 
several days before the experime nt - see details about the 
experimental procedure in Section |4.4|). 

5 http://www.loria.fr/~sivaldi/edhhi.htm 

6 We cannot report the questions, as the questionnaire is 
not publicly available: we refer the interested reader to the 
English manual m and the official French adaptation that 
we used El- 


situation of interaction with robots” (SI), “Negative 
attitude towards social influence of robots” (S2) and 
“Negative attitude towards emotions in interaction with 
robots” (S3). The order of the questions followed the 
original questionnaire; answers were on a Likert-type 
scale, from 1 (Strongly disagree) to 7 (Strongly agree). 
To the best of our knowledge, an official French adap¬ 
tation of the NARS questionnaire does not yet exist. 
For the experiments, we therefore proposed our French 
adaptation of the NARS questionnaire, taken from [35] . 
Our questionnaire was produced with a double transla¬ 
tion made by three different researchers, fluent in both 
English and French, and was validated by a group of 
ten external people to ensure that the French transla¬ 
tion was properly understooc0 We report the questions 
in both French and English in Table [6] in Appendix [8] 

The participants also filled up a post-experimental 
questionnaire for subjective evaluation of the assembly 
task with the robot. The questionnaire was designed to 
catch the impressions and feedback of the participants 
about the task, their interaction experience and in par¬ 
ticular the way they perceived the physical interaction 
with the robot. We report the questions in both English 
and French in Table [7] in Appendix [9] The order of the 
questions followed the original questionnaire; answers 
were on a Likert-type scale from 1 (Totally disagree) to 
7 (Totally agree). 

4.2 Experimental setup 

The experiments were conducted in the Institut des 
Systemes Intelligents et de Robotique (Paris, France), 
in the laboratory room of the iCub robot. 

The experimental setup was organized as depicted 
in Figure [2] The robot was standing on a fixed pole, so 
that it could not fall. The robot was semi-autonomous, 
i.e., it was controlled by an operator hidden behind a 
reflective wall (a plastic divider with reflective surface), 
built to prevent the participants to see the operator 
and the experimenter, while giving the experimenter 
the possibility to monitor the interaction and intervene 
promptly in case of problem^] 

7 A recent paper from Dinet & Vivian [15] studied the 
NARS and validated it on a sample of French population. 
Their study was published only after our work and experi¬ 
ments. They employed their own translation of the question¬ 
naire, which has some slight differences with ours, mostly due 
to some nuances of the French language. These do not pre¬ 
serve the original meaning when translated back into English. 
In their paper there is no mention of a double translation 
mechanism for validating the French adaptation of the ques¬ 
tionnaire. 

8 This was done as a safety measure. However, nothing hap¬ 
pened during the experiments: the experimenter never had to 
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Two cameras were recording the participants, as 
shown in Figure [2j One camera was placed behind the 
robot on its left side, in such a way to observe the hu¬ 
man face and upper-body during the close interaction 
with the robot, while the other one was placed laterally 
to take the scene as a whole. 

The colored rolls used for the assembly task are 
shown in Figure [3] 





Fig. 3 Colored paper rolls used in the assembly task. 


The experiments were carried out with the humanoid 
robot iCub [33] . The robot is approximately 104 cm 
high, weights about 24 kg, and has the shape of a 4 
years old child. 

To facilitate the control of the robot by the oper¬ 
ator, we developed a graphical user interface (GUI) 
to quickly send high-level commands to the robot in 
a wizard-of-Oz mode (WoZ). The operator was con¬ 
stantly monitoring the status of the robot, and could 
intervene to send high-level or low-level commands to 
the robot, in prompt response to unexpected actions or 
requests of the participants, using a dedicated graphical 
interface (see Appendix 10). 

The robot was always controlled in impedance m, 
to make it compliant in case people would touch it ac¬ 
cidentally or intentionally before the construction task. 
When people had to physically manipulate the robot 
to move its arms and accomplish the task, the operator 
was switching the robot into a zero-torque control mode 
that allowed the arms to be driven lightly by the par¬ 
ticipants. For safety issues, the operator could stop the 
robot motion at any time simply switching the robot 
to position control, and at the same time the experi¬ 
menter monitored the whole interaction and was able 
to intervene and stop the robot in case of urgency at any 
time using the robot safety button. Facial expressions 
and speech were enabled (more details in Appendix [To]). 
The robot always assumed the same neutral/positive 
expressions, to avoid confusing the participant or sug¬ 
gest an underlying robot “emotional status”. 


push the safety button, and she never had to stop the physical 
interaction between the robot and the subject. 


4.3 Participants 

Prospective participants were recruited through a generic 
announcement for HRI studies, diffused on a mailing- 
list. Participants that volunteered in the study received 
a 10 euros voucher as a symbolic reimbursement for 
travel expenses. They signed an informed consent form 
to partake in the study and granted us the use of their 
recorded data and videos. N=56 voluntary healthy adults 
took part in this study: 37 women, 19 men, aged 19 to 
65 (mean=36.95, cr=14.32). The participants were all 
native French speakers. 


4.4 Experimental procedure 

After volunteering to take part in the experiment, the 
participants received an ID number to preserve anonymity 
during the study. The personality traits of the partici¬ 
pants were retrieved by questionnaires that were filled 
up through an online web form two weeks before doing 
the experiment, to avoid influences of the questions on 
their behavior. 

The day of the experiment, participants were wel¬ 
comed by the researcher and informed about the over¬ 
all procedure before signing an informed consent form 
granting us the use of all the recorded data for research 
purposes. 

Before the experiment, the participants had to watch 
a short video presenting the iCub, its body parts and 
some of its basic movement^] The video did not provide 
any information about the experiments. It was instru¬ 
mental to make sure that the participants had a uniform 
prior knowledge of the robot appearance (some partic¬ 
ipants may have seen the robot before on the media). 

After the video, each participant was equipped with 
a Lavalier microphone to ensure a clear speech data col¬ 
lection, then was introduced to the robot. The exper¬ 
imenter did not present the experimental setup (e.g., 
show the location of the cameras) except showing the 
robot, and she did not provide any specific instruction 
to the participants about what to do or say and how 
to behave with the robot. Most importantly, she did 
not say anything about the fact that the robot was not 
fully autonomous: since the operator was hidden be¬ 
hind a wall, mixed with other students of the lab, the 
participant had no cue that the robot was controlled 
by someone elsj^j The robot was in a standing po- 

9 It is a dissemination video from IIT showing the iCub, 
available on Youtube: http://youtu.be/ZcTw02dpX8A. 

10 In the post-experiment interview, we asked the partici¬ 
pants if they thought or had the impression that the robot 
was controlled by someone: all the participants thought that 
the robot was fully autonomous. 
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sit ion, gently waving the hands and looking upright, 
while holding a colored toy in its right hand. Once the 
participants were standing and looking in front of the 
robot, they were free to do whatever they wanted: talk 
to the robot, touch it, and so on. 

The experimenter explained that the goal of the task 
was to create an object in collaboration with the robot. 
To create the object, they simply had to assemble two 
paper rolls and fix them with some tape. The partic¬ 
ipant could grab the robot arms to demonstrate the 
bi-manual movement necessary to align the two rolls, 
as shown in Figure [5j 

As the task required a physical interaction with the 
robot, for safety reasons the experimenter had to pro¬ 
vide a short demonstration to show the participant how 
to grab the robot arms in a safe manner and how to 
“move” the robot arms by guidance to teach the robot 
a desired movement^ This demonstration was neces¬ 
sary to make sure that the participants would grab the 
robot forearm on the cover parts covered by the skin, 
for their own security and to prevent damaging of ca¬ 
bles and robot hands (see Figure [4]). All the participants 
received the identical demonstration. To show a move¬ 
ment to the robot, the experimenter gently grasped the 
robot forearms touching the skin and saying “Be com¬ 
pliant” . The robot operator then switched the control 
mode of the robot arms to zero-torque control, so that 
the experimenter could gently move the arms. To make 
the arms hold the position, the experimenter said “Hold 
on ”. The operator then switched the control mode of 
the arms to impedance position control 

The short demonstration was necessary for safety 
reasons, because the participants were not robotics ex¬ 
perts. The experimenter precised that the demonstra¬ 
tion was not to be used as a template on how to perform 
the task with the robot, as neither the task nor the in¬ 
teraction were scripted and the robot would follow the 
participant’s guidance and commands. 

To accomplish the assembly task, the experimenter 
precised that it was necessary to explain to the robot 
how to realize the assembly step by step, even if no 
scripted procedure was provided. No explicit instruc¬ 
tions were given to the participants on how to explain 
the procedure to the robot. 

We remark that the interaction between participant 
and robot was not scripted, and our aim was to let it be 


11 The demonstration was also part of the safety measures 
required by the Ethics Committee to approve our protocol. 

12 The operator could switch the control mode without the 
need of the verbal command, since he had a direct visibility 
of the interaction zone in front of the robot through an addi¬ 
tional camera that was centered on the workspace in front of 
the robot (see Figure |2|. 



Fig. 4 Demonstration on how to safely grab the robot arms 
for kinesthetic teaching in the assembly task: the hands of 
the experimenter grasp the robot forearms on a part covered 
by the skin. On the left, the distributed tactile sensors un¬ 
derneath the cover. 

as much as spontaneous as possible for a first human- 
humanoid interaction. 

The experimenter then gave the participants the 
first two colored paper rolls and invited the participant 
to start the assembly task with the robot; the task had 
to be repeated three times with three pairs of paper 
rolls, so as to build three objects. The paper rolls and 
the tape were conveniently placed on a table next to 
the participants. The participant was free to start at 
his/her own convenience, and to make each trial last 
how much he/she wanted to. Some paper rolls used in 
the experiments are shown in Figure [3j 

Once the participants finished the assembly task, 
repeated three times, the experimenter led the partic¬ 
ipant back to a computer to make him/her fill a post¬ 
experiment questionnaire and then get feedback and 
impressions through a short interview. 

4.5 Data analysis 

The questionnaires scores for extroversion and NARS 
were computed according to their authors’ recommen¬ 
dation. 

The audio-video recordings were analyzed with CowLog 
software [22] . Six events were annotated: beginning of 
the interaction, end of the interaction, beginning of 
a gaze by the participant towards the robot’s face or 
hands (i.e., the contact area), end of that gaze, begin¬ 
ning of an utterance addressed to the robot, end of that 
utterance. The gaze direction was approximated by the 
head orientation, as it is often done in literature [26, 

Hj. We considered two consecutive utterances whenever 
there was a pause of at least 500ms. 

We computed from the events’ timestamps the fol¬ 
lowing six dependent measures: frequency and duration 
of gaze towards the robot’s face, frequency and dura¬ 
tion of gaze towards the robot’s hands, frequency and 
duration of utterances addressed to the robot. These 
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Fig. 5 Demonstration of the assembly task: 1) the participant asks the robot to grasp the two cylinders; 2) the participant 
grabs the robot arms and demonstrates how to move them to align the two cylinders; 3) the participant fixes the cylinders 
with some tape while the robot is holding them; 4) the participant retrieves the assembled object from the robot. 



Fig. 6 Some participants gazing at the robot face. From left to right: when the participant meets the robot, handing the 
cylinders, during the alignment of cylinders, and when the object is built. 
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Fig. 7 Some participants performing the assembly task (screenshots from the front camera). The three images show the 
participants giving the cylinders to the robot (left), grabbing the robot arms (center) then moving the arms to align the 
cylinders (right). 



Fig. 8 Some participants showing the final object to the robot, after the collaborative assembly. 
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Post-experimental questionnaire for human-humanoid collaborative tasks with physical interaction 


Questionnaire Item 

Subjective evaluation 

(score mean ± stdev) 

Questions related to the task 

The assembly task was easy to do. 

5.49 ± 1.39 

The assembly task was interesting to do. 

5.75 ± 1.61 

Someday I could work with this robot to build something of interest. 

5.03 ± 1.67 

Someday I could work with a robot to build something of interest. 

5.87 ± 1.07 

Questions related to the physical interaction (e.g., touching the robot) 

I was worried to must touch the robot to assembly the objects with it. 

2.13 ± 1.46 

I was afraid to touch the hands of the robot. 

2.36 ± 1.72 

I was afraid to damage the robot. 

3.57 ± 1.91 

The robot does not look dangerous. 

6.00 ± 1.57 

The robot is not threatening. 

6.02 ± 1.49 

Questions related to the cognitive/social interaction 

During the assembly, I would have preferred that the robot tells me what it thinks, if 

5.19 ± 1.61 

it understands well. 


The robot understood what I explained to it. 

5.38 ± 1.39 

The robot should be more reactive. 

4.65 ± 1.56 

The robot was nice. 

5.49 ± 1.37 

Questions related to the robot features 

The robot moves its head too slowly. 

3.32 ± 1.41 

The robot moves its arms too slowly. 

3.55 ± 1.33 

The facial expressions of the robot trouble me. 

2.03 ± 1.29 

The voice of the robot is pleasant. 

4.51 ± 1.84 


Table 2 The scores of the post-experimental questionnaire for evaluating the perception and interaction with the iCub in the 
assembly task of this work. The second column reports the mean and standard deviation of the scores attributed on a 7-items 
Likert scale (from l=totally disagree to 7=totally agree) by the N=56 participants in this study. We highlight in bold the 
questions where the score is close to the maximum or the minimum score. 


indicators were normalized by the total duration of the 
interaction, to take into account inter-individual vari¬ 
ability in terms of task duration. 

We used Pearson’s correlation to test of correlation 
of the extroversion and attitude towards robots on the 
frequency and duration of gaze and utterance^] 

5 Results 

The average time to complete the task was 246.10s 
(cr= 75.45). On average, the participants talked to the 
robot for 69.92s (a= 38.38), addressing to it 57.54 ut¬ 
terances (<r= 25.65); they looked at the robot’s face for 
42.55s ((7=29.25), gazing at the face of the robot 12.13 
(cr=6.57) times; they looked at the robot’s hands for 
162.46s (cr=57.14), gazing at the hands 11.30 ((7=5.70) 
times. 

5.1 On the individual factors 

To ensure that the two questionnaires capture two dif¬ 
ferent individual factors, we computed the correlation 

13 Correlation is frequently used to study the link between 
personality and behavior, as discussed in HU, a survey on the 
link between extroversion and behavior where all the cited 
studies use correlations to test their hypothesis. 


between the scores of extroversion and negative atti¬ 
tude towards robot obtained by our population of par¬ 
ticipants. We did not find a significant correlation be¬ 
tween the two (r 2 =-0.213; p=N.S.), neither between ex¬ 
troversion and each of the three sub-scales: negative 
attitude towards interaction with robots (r 2 =-0.156; 
p=N.S.), negative attitude towards social influence of 
robots (r 2 = -0.156; p=N.S.), and negative attitude to¬ 
wards emotions during the course of interactions with 
robots (r 2 =-0.254; p=N.S.). 

These results seem to indicate that both question¬ 
naires represent a fair valuation of the different individ¬ 
ual traits of the participants. 


5.2 Relation of extroversion to gaze and speech 

The participants’ average extroversion score was 111.77 
(cr=22.86; min=61, max=160), which is, according to 
a neutral level of extroversiorp^j 
Table [3] reports the Pearson’s correlation between 
the extroversion score of the participants and their gaze 
and utterance frequency and duration. The extrover¬ 
sion score is significantly and positively correlated to 

14 According to the NEO-PIR, a participant obtaining a 
score bigger than 137 is considered extrovert, while one with 
a score below 80 is introvert. 
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Fig. 9 Scatter graphs showing the frequency (num¬ 
ber/seconds) and duration (normalized ratio) of utterances 
of the participants (N=56), in function of their extroversion 


the frequency and duration of utterances (see Table [3]). 
This can also be seen in the scatter graphs in Figure[9j 
Conversely, the results indicate that extroversion does 
not influence the gaze signal, as there is no significant 
correlation between the personality trait and the gaze 
frequency or the duration of gaze. 


To summarize, the more an individual is extrovert, 
the more he/she will tend to talk to the robot during an 
assembly task to provide instructions. On the contrary, 
an individual with a high score of extroversion will not 
look at the robot’s face or hands more than individuals 
with lower scores. 


Therefore, with reference to the research hypothesis 
expressed in Section 3.2, we confirm Hypothesis 1, and 
reject Hypothesis 2. 


5.3 Relation of negative attitude towards robots to 
gaze and speech 

The participants’ average score for the negative atti¬ 
tude was 45.55 (<7=12.74; min=20, max=77), which is 
a neutral value for the attitude towards robotJ^I 

Table [6] reports the Pearson’s correlation between 
the NARS scores of the participants and their gaze and 
utterance frequency and duration. The results indicate 
that the negative attitude does not influence the ver¬ 
bal signal, as there is no significant correlation with 
the utterance frequency or duration. There is, how¬ 
ever, a partial effect on the gaze signal. Precisely, the 
negative attitude is significantly and negatively related 
to the duration of gaze towards the robot’s face, and 
positively related to the duration of gaze towards the 
robot’s hands, as visible in Figure [To| 

To summarize, the more an individual has a nega¬ 
tive attitude towards robots, the less he/she will tend 
to look at the robot’s face during an assembly task, and 
the more he/she will tend to look at the robot’s hands 
(area of physical contact). The gaze frequency, on the 
contrary, will not change in relation to different posi¬ 
tive or negative attitudes. Nothing can be concluded re¬ 
garding the verbal communication: an individual with 
a more negative attitude towards robots will not speak 
more or less to the robot than other individuals with a 
more positive attitude. 

Therefore, with reference to the research hypothesis 
expressed in Section |3.2[ we reject Hypothesis 3 and 
partially confirm Hypothesis 4 and 5, since the NARS 
score relates to the gaze duration but not to the gaze 
frequency. 

As explained in Section |4j the NARS questionnaire 
is based on three sub-scales. The participants’ average 
scores of negative attitude towards interaction situa¬ 
tions (SI), social influence of robots (S2) and emo¬ 
tions during interaction (S3) were respectively 15.18 
(<r=5.83), 18.80 (<7=5.83) and 11.70 (cr=3.82), whereas 
the mean values of the three sub-scales were 24, 20 and 
12. We performed a thorough investigation of the effect 
of each of the three sub-scales on gaze and utterances. 
For the gaze signal, we did not find any significant corre¬ 
lation between the sub-scales values and its frequency; 
however, we found a significant and negative correlation 
between the gaze duration and SI (r 2 =-0.311; p < 0.05) 
and S2 (r 2 =-0.334; p < 0.05). For the verbal signal, 
we did not find any significant correlation between the 
sub-scales values and the utterance duration, however 


15 According to the NARS, a score over 65 is a sign of nega¬ 
tive attitude towards robots, while a score below 35 indicates 
a rather positive attitude towards robots. 
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Variable 

Extroversion score 

Gaze towards face frequency 

Gaze towards face duration 

r 2 = -0.13 ; p=0.927 (N.S.) 
r 2 = 0.098 ; p=0.471 (N.S.) 

Gaze towards hands frequency 

Gaze towards hands duration 

r 2 = 0.058 ; p=0.671 (N.S.) 
r 2 = 0.215 ; p=0.875 (N.S.) 

Utterance frequency 

Utterance duration 

r 2 = 0.318 ; p=0.017 (<0.05) 
r 2 = 0.321 ; p=0.016 (<0-05) 


Table 3 Correlation between the participants’ extroversion score (computed by NEO-PI-R [T2] ) and their gaze and utterance 
frequency (number/s) and duration (normalized ratio) during the assembly task. 


Variable 

Negative attitude towards robots score (NARS) 

Gaze towards face frequency 

Gaze towards face duration 

r 2 = -0.174 ; p=0.201 (N.S.) 

r 2 = -0.331 ; p=0.013 (<0.05) 

Gaze towards hands frequency 

Gaze towards hands duration 

r 2 = -0.111 ; p=0.413 (N.S.) 

r 2 = 0.355 ; p=0.007 (<0.05) 

Utterance frequency 

Utterance duration 

r 2 = -0.137 ; p=0.314 (N.S.) 
r 2 = 0.033 ; p=0.807 (N.S.) 


Table 4 Correlation between the participants’ negative attitude towards robots score (computed by NARS (34] 5 and their 
gaze and utterance frequency (number/seconds) and duration (normalized ratio) during the assembly task. 


Variable 

NARS-S1 

NARS-S2 

NARS-S3 

Gaze towards face frequency 

Gaze towards face duration 

r 2 =-0.160; p=0.238 
(N.S.) 

r 2 =-0.311; 
p=0.020 (<0.05) 

r 2 =-0.215; p=0.111 
(N.S.) 

r 2 =-0.334; 
p=0.012 (<0.05) 

r 2 =0.009; p=0.947 

(N.S.) 

r 2 =-0.120; p=0.377 
(N.S.) 

Gaze towards hands frequency 

Gaze towards hands duration 

r 2 =-0.073; p=0.592 
(N.S.) 

r 2 =0.381; 
p=0.004 (<0.05) 

r 2 =-0.138; p=0.310 
(N.S.) 

r 2 =0.334; 
p=0.012 (<0.05) 

r 2 =-0.043; p=0.754 
(N.S.) 

r 2 =0.094; p=0.491 

(N.S.) 

Utterance frequency 

Utterance duration 

r 2 =0.018; p=0.895 

(N.S.) 

r 2 =0.172; p=0.203 

(N.S.) 

r 2 =-0.093; p=0.497 
(N.S.) 

r 2 =0.058; p=0.673 

(N.S.) 

r 2 =-0.323; 
p=0.015 (<0.05) 

r 2 =-0.249; p=0.063 
(N.S.) 


Table 5 Correlation between the scores of the NARS sub-scales (computed as in [34]) °f the participants and their gaze and 
utterance frequency (number/seconds) and duration (normalized ratio) during the assembly task. 


we found a significant negative correlation between S3 
and the utterance frequency (r 2 =-0.323 ; p < 0.05). 

To summarize, the more people display a negative 
attitude in the interaction (SI) with the robot and are 
concerned by the social aspect (S2) of the interaction, 
the less they will look at the robot. Conversely, the 
more people are negative about the emotions during the 
interaction (S3), the less they will talk to the robot. 

5.4 Post-experiment evaluation 

The post-experimental questionnaire for subjective eval¬ 
uation does not have a score. It was designed by the 
experimenter to get a simple feedback on the user expe¬ 
rience, retrieve the global impression and the personal 
evaluation of the participants on some aspects of the 
task. Table [2] reports on the average score for each item 
in the questionnaire. We highlighted in bold the most 
significant questions, which have an average score that 
is close to the maximum (7) and minimum (1). 


6 Discussion 

As discussed in Section [2j the literature in psychology 
highlights an effect of personality traits, particularly of 
the extroversion dimension, on the dynamics of speech 
and gaze. Likewise, a negative attitude towards robots 
will influence the time of the verbal response during 
interactions. These results induced us to question the 
reliability of the metrics used for the estimation of the 
engagement in HRI, classically based on the dynamics 
of social signals, as their dynamics may be altered by 
individual factors that are not taken into account in the 
models of engagement. 

In the following we discuss here the results on the 
correlations between two individual factors (extrover¬ 
sion and negative attitude towards robots) and the dy¬ 
namics of speech and gaze observed during the human- 
robot assembly task. We argue about the implications 
of our study for the HRI community and consider the 
limits of our study. 
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let people gaze at the robot face more frequently. An¬ 
other element that might explain this result is the lack 
of a proper joint attention system implemented on the 
robot for this experiment, particularly for mutual gaze: 
once the human touched the robot arms to start its 
kinesthetic demonstration, the robot was simply shift¬ 
ing its gaze from the human face to its own hands, and 
was not seeking eye-contact during the teaching phase. 
However, since the participants were all in the same 
conditions, a correlation between gazing behavior and 
extroversion should have been detected. 



Fig. 10 Scatter graph showing the duration of gaze (nor¬ 
malized ratio) of the participants (N=56) towards the robot 
hands and face, in function of their NARS score. 

6.1 Extroversion & social signals 

As detailed in Section [5T2| we found that there is a pos¬ 
itive and significant correlation between the extrover¬ 
sion score and the frequency and duration of utterances. 
The more the individual is extrovert, the more often 
and longer he/she will tend to address the robot during 
the interaction. This result is consistent with observa¬ 
tions of human-human interactions, showing that intro¬ 
verts tend to talk less than extroverts [44]. Conversely, 
we did not find a significant correlation between the 
extroversion and the gaze frequency or duration. This 
finding is partially contrary to what has been observed 
in |56], where the author found a relationship between 
the extroversion and the amount of time spent gazing 
while listening. However, the author also observed that 
the gaze duration was not related to extroversion when 
people were speaking. Since in our task, the partici¬ 
pants were supposed to talk to the robot to explain 
the task, we can presume that this could be one possi¬ 
ble cause of the non-effect of the extroversion on gaze 
duration. Furthermore, our assembly task induced the 
participants to focus their attention also on the robot 
hands, while we can presume that a different task will 


6.2 Negative attitude towards robots & social signals 

As presented in Section [53] we found that the negative 
attitude towards robotics tends to be related to the time 
spent looking at the robot’s face and the robot’s hands 
during the interaction (Table [6]). 

Overall, the participants were probably not appre¬ 
hensive facing the interaction, while they were likely 
mildly concerned regarding the social and emotional 
aspect of the interaction. With a deeper look at the 
NARS sub-scales (Table [5]), we found that the more 
one has a negative attitude towards the interaction sit¬ 
uation (SI) and the social influence of the robot (S2), 
the shorter it will look at the robot face. These results 
may indicate that people will look less at the robot as 
symptom of their aversion towards the robot as social 
agent, or because of their anxiety in interacting with it. 
This is consistent with the duration of gaze directed to¬ 
wards the robot’s hands: it makes sense that the more 
one has a negative attitude towards interacting with 
a robot, the more he/she will spend time looking at 
the robot’s hands in a task where there is physical in¬ 
teraction with the robot occurring at the hands level. 
Interestingly, these dimensions (SI and S2) do not seem 
to have influence on the speech production. Conversely, 
people concerned with emotional robots (S3) will tend 
to have less verbal exchange with the robot. 

We found significant correlations for the gaze dura¬ 
tion, but not for the gaze frequency: this result could 
be slightly biased by the lack of mutual gaze exhib¬ 
ited by the robot. We expected that an individual with 
positive attitude would look more at the face trying to 
engage and get the robot’s attention, while an individ¬ 
ual with negative attitude would have the tendency to 
avert his/her look towards the robot face. However, the 
lack of a joint attention mechanism can explain the low 
number of gazes towards the robot face (12.13 zb 6.57) 
and the fact that they do not seem to be correlated 
with the negative attitude. 

We did not find any significant correlation with the 
verbal signal. Our results seem to contradict those of 
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[36] , that brought evidence that a negative attitude to¬ 
wards robots had repercussions on the timing of the 
verbal response. However, in their study the authors 
were focusing on reaction times to robot’s stimuli, not 
on the frequency or duration of utterances. Looking at 
the NARS sub-scale, we found a significant correlation 
between the negative attitude towards emotions (S3) 
and the utterance frequency. This result is in line with 
[34] . where the authors highlight the stronger negative 
attitude towards emotions (S3) for individuals dealing 
with small-sized humanoids robots, which corresponds 
to the case of our robot iCub. 

Overall, we expected the negative attitude to have 
a stronger influence on the amount of verbal and non¬ 
verbal signals exchanged during the interaction. We ex¬ 
pected that the physical contact with the robot and 
the close interaction would particularly highlight the 
effect of the negative attitude. We speculate that this 
result could be influenced by a social desirability bias: 
the participants maybe tried to perform better as sub¬ 
jects in the study, eventually behaving in a forced way. 
The positive evaluation that we retrieved from the post- 
experimental questionnaire (Table [2| could also be par¬ 
tially related to that. 

As we found few studies dealing with attitude to¬ 
wards robots and social signals, this part of our work 
may be considered as exploratory. 


6.3 Subjective impressions 

Overall, the subjective evaluations and the feedback 
from the interviews encourage us to think that the in¬ 
teraction with the robot was pleasant and the partici¬ 
pants were spontaneous in their behavior. With refer¬ 
ence to the subjective evaluations scores in Table [2] the 
participants evaluated positively the experiment with 
the robot and the robot itself. We highlighted in bold 
the questions where the average score is close to the 
maximum (7) or minimum (1) score: this provides a 
rough indication of the tendencies of the participants. 
They found the task quite interesting and easy to do, 
and they also had a positive impression of the robot. 
Interestingly, they were not afraid to touch or interact 
physically with the robot (e.g., not worried to touch the 
robot, not afraid to touch the hands). Also the robot 
was not looking dangerous to their eyes. Considering 
that the experiment was their first live interaction with 
the robot, this score was quite surprising for us: we ex¬ 
pected the novice/naive people to report some anxiety 
in front of the robot. However, when we interrogated 
the participants about this, most of them said that 
the safety demonstration reassured them about the fact 


that it was possible to touch the robot without prob¬ 
lems; others said that the robot size and child-like ap¬ 
pearance made them suppose that it was safe to touch 
it as the robot “won’t hurt”. We asked to the partic¬ 
ipants if they thought or had the impression that the 
robot was operated by someone else: ah the participants 
denied this possibility. Almost ah the participants asked 
us if the robot had learned correctly what they had been 
teaching. 

6.4 Implications for automatic personality assessment 

Social robots should be able to adapt their behaviour 
taking into account the unique personality of their in¬ 
teracting partners [2]. To this end, they need to learn a 
model of their behaviour, that can be built using mul¬ 
timodal features extracted during online interaction, 
physical features, social context, psychological traits of 
the partner such as personality or attitudes etc. Cur¬ 
rently, a crucial challenge in HRI is the automated on¬ 
line estimation of these individual factors: for personal¬ 
ity traits, in the personality computing literature this is 
called Automatic Personality Recognition, which aims 
at “inferring self-assessed personalities from machine 
detectable distal cues” (see [52] for an exhaustive re¬ 
view). Since personality and individual traits influence 
the production of verbal and non-verbal signals, it is im¬ 
portant to gain more quantitative knowledge on their 
relations to be able to produce predictive models that 
can be used to improve the HRI experience. For ex¬ 
ample, Tapus et al. mu] showed that an adaptive 
robot matching the personality of the patient is bene¬ 
ficial for assisted therapy, and that extrovert/introvert 
people prefer to interact with robots that exhibit extro¬ 
vert/introvert personality features pQ. 

Thanks to the findings of our work, we now have a 
quantitative indicator for estimating the extroversion of 
a human interacting with a robot in a collaborative as¬ 
sembly task, by looking at the his/her speech dynamics. 
At the same time, we can derive a simple linear model 
for estimating the NARS based on the duration of gaze 
towards the robot face. 

We are extending these findings to the other exper¬ 
iments of the EDHHI project, for example we already 
showed that it is possible to predict extroversion from 
non-verbal features during a thin slice of simple face- 
to-face interaction 189] . 

6.5 Implications for measuring engagement 

Our goal in this paper is not to measure the engagement 
of a particular HRI situation, but to provide quantita- 
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tive evidence that the computational models of engage¬ 
ment should take into account individual factors. Such 
models are commonly based on the dynamics of sig¬ 
nals such as gaze and speech wmm- The engagement 
metrics may be biased by individual factors such as ex¬ 
troversion and negative attitude towards robots, factors 
that we have not met in the engagement literature for 
an assembly task such as the one presented in this pa¬ 
per. Our results indicate that extroversion and negative 
attitude towards robots are related to the temporal dy¬ 
namics of gaze and speech during a human-robot in¬ 
teraction. If the engagement depends on the frequency 
or rhythm of such social signals [4Tj . then an introvert 
individual or one with negative attitude towards robots 
will be considered as less engaged than an extrovert or 
one with a positive attitude, simply because the first is 
more likely to produce less signals (gaze or utterances) 
than the second. The design of robust models of en¬ 
gagement should therefore take these individual factors 
into account. 

We further notice that the models for evaluating the 
engagement refer mostly to dyadic tasks without phys¬ 
ical interaction. For tasks such as the one of this paper, 
the cooperative assembly may induce the people to gaze 
more at the hands and at the objects than in other tasks 
where there is no co-manipulation. Therefore, there is 
a potential risk that the estimated engagement of the 
HRI may be partially biased by the “task engagement”. 
We will perform the study with other tasks to verify, 
because the current results are not sufficient to pro¬ 
vide conclusions on this matter. This problem actually 
highlights a weakness of the models used for the eval¬ 
uation of the engagement which are uniquely based on 
the dynamics of social signals. 

6.6 Implications for human-robot physical interaction 

The underlying idea in our work is that by studying the 
factors that influence the production of social signals 
together with the exchanged forces, one can improve the 
design of robot controllers during physical interaction 
and, for example, achieve better performances during 
cooperative tasks. 

More and more people are going to interact physi¬ 
cally with robots, for a variety of tasks: from co-working 
in manufacturing, to personal assistance at home. This 
requires for the robot the ability to control precisely the 
interaction forces, but also to be able to interact in a 
“social” way, adapting to the reaction of each individ¬ 
ual, so that people can trust the robot, accept it and 
be engaged interacting with it. 

Together with the contact forces, it is therefore nec¬ 
essary to study the other verbal and non-verbal signals 


that are exchanged during the physical interaction, such 
as gaze, prosody, gestures, etc. All these signals can be 
used to study the comfort and the engagement of the 
people interacting with the robot, providing the nec¬ 
essary feedback for the robot to adapt its action. Re¬ 
searchers studying cooperative tasks usually focus on 
sequencing and patterns of cooperation [53], adapta¬ 
tion of roles and physical forces [48], while the social 
signals emitted during such tasks are not fully explored. 
Conversely, the dynamics of social signals, such as gaze 
and speech, is mostly studied during tasks that do not 
involve a direct physical interaction, such as dialogues 
and games [8j[3l[9]. 

In this paper, we provide some evidence about the 
dynamics of speech and gaze during a cooperative as¬ 
sembly task with physical interaction. To the best of 
our knowledge, this is the first work analyzing social 
signals during a cooperative assembly task with a hu¬ 
manoid robot. 

6.7 Limits of the study 

The present study brings significant new results to the 
field of human-robot interaction and engagement. How¬ 
ever, we discuss the limitations of our study. 

6. 7 .1 Ordinary people 

In our study, participants interacting with the robot are 
not experts “robot-users”. Our findings could change if 
we considered people with different levels of exposure 
to robotics and technology and expertise with iCub or 
other robots. Our intuition is that the prior exposure 
to robotics is likely to appear in the dynamics of verbal 
and non-verbal signals. This question is currently under 
investigation. 

6.7.2 HHI vs HRI 

It would have been interesting to have a control study 
about human-human physical interaction for the same 
assembly task. This kind of study would enable to com¬ 
pare if the dynamics of the social signals emitted by the 
human change when interacting with a human or with 
a robot during a physical collaboration, in function of 
the individual factors of the human. However, the same 
experiment done by two humans would have been too 
different in our view, and not only because the engage¬ 
ment of human-human and human-robot interaction is 
different. In our experiment, iCub is a child-like robot, 
and the task is very simple: it would have been dif¬ 
ficult to make it engaging for two adults, and would 
have made sense to do it with an adult and a child. 
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However, the child should have been constrained to be 
basically not too reactive. We actually did, in a prelimi¬ 
nary investigation, record the assembly task performed 
by a father and his child, two sisters (one older than 
the other) and two adults. Despite our instructions to 
the children, we found very difficult to reproduce the 
experiment with similar conditions to the ones of the 
HRI experiment. For example, it was difficult for one 
to not to react to the action of the other: we observed 
anticipatory gaze, joint gaze, anticipatory movements 
of the arms before and during the kinesthetic teach¬ 
ing, etc. These mechanisms were not implemented in 
our robot. Empathy, personality traits and other fac¬ 
tors linked to the human partner acting as the robot 
should also have been taken into account. 

6.7.3 Generalization 

In this study, we focused on an assembly task requir¬ 
ing physical interaction. The situation addressed in this 
study is extremely relevant to the robotics community 
and particularly to those studying collaborative robots 
and robotic co-workers. It is difficult to predict whether 
our results can be generalized to other tasks. This ques¬ 
tion is currently under investigation. 

6.7.4 Human-like and child-like appearance 

Another limit of our study is given by the human¬ 
like appearance of the robot, which may influence the 
production of social signals. This question was equally 
raised in other studies with human-like robots, for ex¬ 
ample by Huang and Thomaz with the Meka robot [23]. 
As we already remarked in our previous studies with 
iCub [26] , the anthropomorphic appearance may induce 
a biased perception of the robot and ultimately influ¬ 
ence the dynamics of speech and gaze, especially the one 
directed towards the robot face. However, differently 
from the previous study, before the experiments we told 
the participant that the robot had a limited knowledge 
and they had to teach the robot how to build the object. 
As their expectations about the robot intelligence were 
lower, their subjective evaluation of the robot resulted 
to be globally more positive than the one of the previ¬ 
ous experiment (see Table [ 2 ]). The type of task could 
also play a role: here the participant had a very close 
interaction with the robot, and had to use the hands 
of the robot for building an object. The task implies 
manipulation skills and cognitive skills that are usually 
attributed to humans and intelligent agents. For exam¬ 
ple, learning to “align” the cylinders means learning 
the proper arm movements but also understanding the 
concept of “to align” and “to assemble an object made 


by two parts.” Some participants were so engaged with 
the robot and the task that spent time to make sure 
that the robot could learn these concepts, showing the 
assembly gesture before engaging the kinesthetic teach¬ 
ing, and showing the final object explaining the result 
of the action after the kinesthetic teaching (see Figure 
[8|. It is also possible that the child-like appearance of 
the robot facilitated the emergence of these behaviors. 
However, we did not consider in our study the attitude 
towards children or having children as possible individ¬ 
ual factors: this is a limitation of the study. 

Would the results be different with another type 
of robot? For example a collaborative industrial robot 
without an anthropomorphic head? We are currently 
investigating this question. 


7 Conclusions 

In this paper we reported on the influence of extro¬ 
version and attitude towards robots on the temporal 
dynamics of social signals (i.e., gaze toward the robot’s 
face and speech), during a human-robot interaction task, 
where a human must physically cooperate with a robot 
to assemble an object. 

We conducted the experiments with the humanoid 
robot iCub and N=56 adult participants. We focused 
on extroversion and negative attitude towards robots, 
and reported on their effect on gaze and speech. 

We found that the more people are extrovert, the 
more they tend to talk and longer to the robot. We also 
found that the more people have a negative attitude 
towards robots, the less they tend to look at the robot’s 
face. 

The assembly task entailed a physical contact be¬ 
tween the human and the robot: we found that the 
more people have a negative attitude towards robots, 
the more they look at the area where the physical con¬ 
tacts occurred and the assembly task was executed (in 
this case, the robot’s hands). 

Our results provide evidence that among the met¬ 
rics classically used in human-robot interaction to es¬ 
timate engagement m, one should also take into ac¬ 
count inter-individual factors such as extroversion and 
attitude towards robots, because these individual fac¬ 
tors influence the dynamics of social signals, hence the 
dynamics of the interaction. Furthermore, we highlight 
a potential risk for the classical models of engagement, 
that do not provide a solution to the problem of decou¬ 
pling the engagement towards the robot and the en¬ 
gagement towards the task. These two are not easily 
discernible from the study of social signals, for many 
cooperative tasks. 
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To summarize, we propose an original way to deal 
with engagement and social signals during HRI: with 
a more comprehensive and multidisciplinary approach, 
we explicitly consider the exchanged social signals and 
the individual factors that may influence the produc¬ 
tion of such signals. Particularly, we do not only con¬ 
sider the personality traits of the humans, but also their 
attitudes towards robots that may be critical for their 
behavior during the interaction with a robot. 

The influence of personality traits on social signals 
should be taken into account if we wish to build robots 
capable of automatically estimating the engagement of 
the human partner - in tasks with or without physi¬ 
cal interaction. Of course, other dimensions should be 
investigated, for instance individual traits (e.g., other 
personality dimensions from the Big-Five m, such as 
openness or neuroticism), social attitudes or environ¬ 
mental and contextual factors. Recent studies show that 
it is possible to retrieve personality traits online from 
audio or video streams m • It will be therefore feasi¬ 
ble to pair the personality estimation with the social 
signals analysis, to provide better models of human en¬ 
gagement. Such models will be critical to adapt the 
robot’s behavior to the single individual reaction. 

Our insights can play an important role for letting 
the robot adapt its behavior to the human response. For 
example, to re-engage the dis-engaged partner into a co¬ 
operation by means of relevant social signals or physical 
actions. 

8 Questionnaire for negative attitude towards 
robots (NARS) 

See Table [6] for the questions in English and French. 

9 Questionnaire for post-experimental 
evaluation of the assembly task 

See Table [7] for the questions in English and French. 

10 Software for operating the robot 

The WoZ GUI was organized in several tabs, each ded¬ 
icated to a specific task, such as controlling the robot 
movements (gaze, hands movements, posture), its speech, 
its face expressions etc. The GUI events are elaborated 
by the actionServer module and others developed by the 
authors in previous studies I2SEI- All the developed 
software is open sourcj^j 

16 See download instructions at http://eris.liralab.it/ 
wiki/UPMC_iCub_proj ect/MACSi_Software 


Figure [TIJ-A shows the tab related to the control 
of head gaze and hands movements. It is designed to 
control the gaze direction in the Cartesian space, with 
relative movements with respect to the fixation posi¬ 
tion (joints at zero degrees in both eyes and neck). The 
hands can be quickly controlled by a list of available 
pre-defined grasps, plus primitives for rotating the palm 
orientation (towards the ground, skywards, facing each 
other). It is also possible to control the hand position 
and orientation in the Cartesian space, providing rel¬ 
ative movements with respect to the current position 
with respect to the Cartesian base frame of the robot 
(the origin located at the base of the torso, with x-axis 
pointing backward, y-axis pointing towards the right 
side of the robot and z-axis pointing towards the robot 
head). Some buttons allow the operator to control the 
whole posture of the robot and bring it back to pre¬ 
defined configurations. Figure [TT}b shows the part of 
the GUI dedicated to switching the control mode of the 
arms: position, zero-torque, then impedance with high, 
medium and low stiffness. The default values of the 
module demoForceControf^\ for stiffness and damping 
were used. During the experiments, the arms were con¬ 
trolled in the “medium compliance” impedance mode, 
which allows the robot to exhibit a good compliance 
in case of unexpected contacts with the human partici¬ 
pant. When the participant had grabbed the robot arms 
to start the teaching movement, the operator switched 
the control to zero-torque, which made the arms move 
under the effect of the human guidance. Figure [12]- A 
shows the tab related to the robot’s speech. It is de¬ 
signed to quickly choose choose one among a list of 
pre-defined sentences and expressions, in one of the sup¬ 
ported languages (currently French or English). It is 
also possible to generate new sentences, that can be 
typed on-the-fly by the operator: this is done to allow 
the operator to quickly formulate an answer to an un¬ 
expected request of the participant. The operator can 
switch between the supported languages, but of course 
in the experiments of this paper the robot was always 
speaking French (as all the participants were native 
french speakers). The text-to-speech in English is gen¬ 
erated by the festival library, while in French by the 
Pico library. Figure [12]- B shows the tab related to fa¬ 
cial expressions. The list of facial expressions along with 
their specific realization on the iCub face (the combina¬ 
tion of the activation of the LEDs in eyelids and mouth) 
is loaded from a configuration file that was designed by 
the experimenter. 


17 https://github.com/robotology/icub-basic-demos/ 
tree/master/demoForceControl 
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Negative Attitude Towards Robots Questionnaire (NARS) 


N. 

Questionnaire Item in English 

Questionnaire Item in French 

Subscale 

1 

I would feel uneasy if robots really had emotions. 

Je me sentirais mal a l’aise si les robots avaient 
reellement des emotions. 

S2 

2 

Something bad might happen if robots developed 

Quelque chose de mauvais pourrait se produire si 

S2 


into living beings. 

les robots devenaient des etres vivants. 


3 

I would feel relaxed talking with robots. 

Je serais detendu(e) si je parlais avec des robots. 

S3* 

4 

I would feel uneasy if I was given a job where I had 

Je me sentirais mal a l’aise dans un travail ou je 

SI 


to use robots. 

devrais utiliser des robots. 


5 

If robots had emotions, I would be able to make 

Si les robots avaient des emotions, je serai capable 

S3 


friends with them. 

de devenir ami(e) avec eux. 


6 

I feel comforted being with robots that have emo¬ 

Je me sens reconforte(e) par le fait d’etre avec des 

S3* 


tions. 

robots qui ont des emotions. 


7 

The word “robot” means nothing to me. 

Le mot “robot” ne signifie rien pour moi. 

SI 

8 

I would feel nervous operating a robot in front of 

Je me sentirais nerveux/nerveuse de manoeuvrer un 

SI 


other people. 

robot devant d’autres personnes. 


9 

I would hate the idea that robots or artificial intel¬ 

Je detesterais que les robots ou les intelligences ar- 

SI 


ligences were making judgments about things. 

tificielles fassent des jugements sur des choses. 


10 

I would feel very nervous just standing in front of 

Le simple fait de me tenir face a un robot me 

SI 


a robot. 

rendrait tres nerveux/nerveuse. 


11 

I feel that if I depend on robots too much, some¬ 

Je pense que si je dependais trop fortement des 

S2 


thing bad might happen. 

robots, quelque chose de mauvais pourrait arriver. 


12 

I would feel paranoid talking with a robot. 

Je me sentirais parano'iaque de parler avec un 
robot. 

SI 

13 

I am concerned that robots would be a bad influ¬ 

Je suis preoccupe(e) par le fait que les robots puis- 

S2 


ence on children. 

sent avoir une mauvaise influence sur les enfants. 


14 

I feel that in the future society will be dominated 

Je pense que dans le futur la societe sera dominee 

S2 


by robots. 

par les robots. 



* = reverse item 

Table 6 NARS questionnaire for evaluating the negative attitude towards robots. The order of the questions follows the 
original questionnaire, proposed by Nomura et al. in I35| . The second column reports the original questions in English. The 
third column reports our double translation of the questions in French. 


Post-experimental questionnaire for evaluation of the human-humanoid collaborative tasks with physical in¬ 
teraction_ 


N. 

Questionnaire Item in English 

Questionnaire Item in French 

1 

The assembly task was easy to do. 

La tache de constructions etait facile a faire. 

2 

The assembly task was interesting to do. 

La tache de construction etait interessante a faire. 

3 

I was worried to must touch the robot to assembly the 

J’etais inquiet (e) de devoir toucher le robot pour con- 


objects with it. 

struire les choses avec lui. 

4 

During the assembly, I would have preferred that the 

Pendant la construction, j’aurais prefere que le robot 


robot tells me what it thinks, if it understands well. 

m’informe de ce qu’il pense, s’il comprend bien. 

5 

I was afraid to touch the hands of the robot. 

J’avais peur de toucher les mains du robot. 

6 

I was afraid to damage the robot. 

J’avais peur d’abimer le robot. 

7 

The robot was nice. 

Le robot etait sympathique. 

8 

The robot understood what I explained to it. 

Le robot a compris ce que je lui ai explique. 

9 

The robot answers to questions too slowly. 

Le robot reponds aux questions trop lentement. 

10 

The robot moves its head too slowly. 

Le robot bouge la tete trop lentement. 

11 

The robot moves its arms too slowly. 

Le robot bouge les bras trop lentement. 

12 

The robot should be more reactive. 

Le robot devrait etre plus reactif. 

13 

The facial expressions of the robot trouble me. 

Les expressions faciales du robot me genent. 

14 

The voice of the robot is pleasant. 

La voix du robot est agreable. 

15 

The robot is not threatening. 

Le robot n’est pas menacant. 

16 

The robot does not look dangerous. 

Le robot ne semble pas dangereux. 

17 

Someday I could work with this robot to build some¬ 

Un jour, je pourrais travailler avec this robot pour con- 


thing of interest 

struire quelque chose d’interessant 

18 

Someday I could work with a robot to build something 

Un jour, je pourrais travailler avec un robot pour con- 


of interest 

struire quelque chose d’interessant 


Table 7 Post-experimental questionnaire for evaluating the perception and interaction with the iCub in the assembly task of 
this work. The third column reports the original questions in French (the participants were all native French speakers). The 
second column reports our double translation of the questions in English. 
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Status Action Complex Action Speech Expression TaskPlayer Playlist Play 
What i do 


Skyward 
Middle Rotation 
To the ground 
Open Hand 


l.1 ] 

1 

/ 

<- 

Look Front 

1 _> J 

7 

1 

\ 


l°.l. 

[m] meters 

Left Hand 

I ' 

1 

/ 


Forward 

Back 

1 -> 

/ 

1 

\ 



Close bottle small Close bottle small 
Close bottle big Close bottle big 
Close cylinder constru Close cylinder constru 


Right Hand 
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Fig. 11 WoZ GUI. A: the tab dedicated to the quick control of gaze, grasps and hands movements in the Cartesian space. 
The buttons sends pre-defined commands to the actionsServer module, developed in EZI- The buttons of the bottom row 
allows the operator to bring the robot in pre-defined postures (whole-body joint configurations): they were pre-programmed 
so as to simplify the control of the iCub during the experiments, in case the operator had to “bring it back” to a pre-defined 
configuration that could simplify the interaction for the participants. They were useful also for prototyping and testing of the 
experiments. B: part of the GUI dedicated to switching the control mode of the arms - position, zero-torque, then impedance 
control with low, medium and high stiffness. 
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Fig. 12 WoZ GUI. A: the tab related to the robot’s speech. The operator can choose between a list of pre-defined sentences and 
expressions, or he can type a new sentence on-the-fly: this is done to be able to quickly formulate an answer to an unexpected 
request of the participant. The operator can switch between french and english speech (at the moment, the only two supported 
languages), even if in the experiments of this paper of course the robot was always speaking french. B: the tab related to 
facial expressions. The list of facial expression along with their specific realization on the iCub face (the combination of the 
activation of the LEDs in eyelids and mouth) is loaded from a configuration file. 
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